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ABSTRACT 


Consider  a  random  vector  (X^jX^)  distributed  as  a  bivariate  normal 
with  mean  vector  zero,  and  dispersion  matrix  T,  -  ((o^)).  Suppose  we 
are  given  samples  of  sizes  n^  and  n^»  respectively,  from  the  marginals  of 
X^jX^,  and  a  sample  of  size  n3  from  the  bivariate  population  of  (X^X^). 
Suppose  the  problem  is  to  obtain  a  good  estimator  of  £  based  on  the  above 
(incomplete)  sample.  In  this  paper,  four  estimators  of  Z  are  compared 
using  Monte  Carlo  methods,  and  it  is  found  that  a  certain  relatively 
simple  estimator  of  I  is  the  "best"  or  close  to  the  best  in  almost  all 
situations . 
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1.  Introduction.  Consider  the  problem  outlined  in  the  summary.  In  the  literature 
there  are  proposed  many  methods  for  dealing  wirh  this  situation,  which  is  also 
known  as  the  case  of  missing  observations  in  multivariate  statistics.  Four 
such  methods  are  compared  in  this  study.  The  first  was  proposed  originally  by 
Wilks  (1932)  for  the  case  of  two  responses,  and  later  generalized  by  Kleinbaum 
(1970)  to  an  arbitrary  number  p  of  responses ,  and  to  a  more  general  design  for 
the  location  parameters.  The  second  method  is  a  variant  of  the  first  one.  The 
The  third  method  is  due  to  Hocking  and  Smith  (1968).  The  fourth  one  corresponds 
to  the  principle  of  maximum  likelihood  (m.l.).  "me  tneoreticai  evaluation  of  the 
optimality  properties  of  these  estimators,  particularly  the  third  and  the  fourth, 
seems  to  be  cumbersome,  except  for  large  sample  properties,  such  as  consistency 
and  asymptotic  efficiency.  But  our  concern  here  is  with  the  more  relevant  situa¬ 
tions  when  the  sample  sizes  are  not  necessarily  of  the  order  needed  for  invoking 
asymptotic  properties.  For  this  reason  we  resorted  to  Monte  Carlo  simulation 
techniques . 
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2.  The  Four  Estimators  of  £.  The  four  estimators  E^(i  -  1,2, 3, 4)  of  F  will  now 
be  spelled  out  in  detail.  Let  S^(i  =  1,2)  denote  the  sample  of  size  n^  available 
from  the  marginal  distribution  c  '  the  ith  response,  and  let  S3  denote  the  (complete) 
bivariate  sample  whose  size  is  n  .  Also,  s  and  s  will  symbolize  the  mean 

o  XXX  XXdJ 

square  of  the  observations  on  the  first  response  from  and  S^,  respectively. 
Similar  is  the  definition  of  s222  and  s223.  The  mean  cross-product  of  the  first 
and  second  response  over  the  units  of  S^,  is  denoted  by  s^^.  Finally,  let  r  and 


p  be  equal  to  s^/  ’/s113s223  and  °12^  l^ll°22  ’  resPectively • 


‘  ^  123  113  223  “ - 12' 

(i)  Estimator  E^  =  ((d^))  is  given  by 


A  _  nlSlll  +  n2S222  *  _  A  A  n2S222  +  n3S223 

(2-1)  hi - 5-rrc; — -  •  h2  -  hi  -  si?3-  "22  =  — —  • 

Note  that  E.,  is  not  necessarily  nonnegative  definite. 

(ii)  Estimator  E2  =  ((ck!))  is  given  by:  01;L+  =  d.^,  c>22+  =  d 22>  and  a12+ 
r  y/  dn®22  ‘  This  is  positive  definite  with  probability  1. 

jfc 

( iii)  Estimator  E  =  ((a..)).  This  is  given  by 
_ t) 


(2.2) 


hi  *  hj  +  Bij<a22  -  S222>’  "hel,e 


(2.3) 


(2.4) 


d.  .  =  s.  .„  +  a.,(s  -  s,..  )  , 

13  133  i]  113  111 


(a  ,a  ,a  ) 
11’  12’  22 


ni  \  I  S123  S123 

"l  *  ”3  ’S113  ’  s?..  '' 
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(2.5)  C12 


2  2 
n3  n2 


n3(nl  +  n3} 


2812 

“l  +  n3 

-1 

x 

23  a 

12  22 

2ni 

3  3 
12 

3ii2J 

n3 

n3(Vn3: 

)311 

23222 

2ni 

3124 

n  (n  +  n  )  ,  2 

31  3  °11 


(iv)  Estimator  =  ((&..)),  This  is  the  maximum  likelihood  estimator.  To 
derive  it,  let  L  denote  the  likelihood  function  of  the  total  sample.  Then 

L-i  J  f\  „  1 


(2.6) 


3  log  L 


n  11 


0  0 


0  0 

0 


022 

-012‘ 

a_2S 

11  111 

o' 

12 

°U 

2 

0 

0 

a_1S 
22  222 


a22S113"  2°12a22S123  +  °12S223 


-a  as  +  (a  +aa)S  -aaS 
12  22  113  '  12  11  22  123  12  11  223 


_ai2a22S113  +  (CT12  +  °11CT22)S123  "  ai2°llS223 
°11S223  '  2ai2ailS123  +  °12S113 


Let  911  ~  S113°ll’  612  =  S123ai2’  922  =  S223°22’  fl  "  S111S113’  f2  =  S222S223* 
Substituting  these  in  the  likelihood  equation  obtained  by  equating  3  log  L/ 

3 Z  to  zero,  and  doing  some  simplification,  we  arrive  at  the  following  set  of 
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three  equations  that  are  linear  in  the  parameters  8.. 


[n^l-p  )  Vn3]9ll"2n3P  012+n3P  622  =  (1-p Hr^U-p )  +  n3]  , 

2 

n3P"6ll  ”  2n3p2012  +  Cn2(1'p2)  f2  +  n3]022  =  (l-P^Cn^l-p2)  +  n  ] , 


911  - 

(1  + 

P  >012 

These 

give 

(2.7) 

/•» 

911 

=  A_1[ 

(2.8) 

022 

=  A_1[, 

(2.9) 

012 

=  (  -  A  ~ 

(2.7)  8i;l  =  A  [n1n2(l  -p  )f2  +  "P  ^f2  +  n2n3P  +  nin3  +  n3 

(2-8)  022  =  A  1[n1n2(l  -  p4)^  +  n1n3(l  -  p2)^  +  n^p2  +  n2n3  +  n^l, 

( <■>  n\  a  _  /  *-l\  _  /  i  _2»_  ~  *  r-  t  1 


n  =  ni  +  n2  +  n3’  and  A  =  nin2(1  ”  p  ^flf2  +  nln3fl  +  nen3f2  +  n3*  * 

Thus,  we  obtain  =  s.^3  /  0^.»  (i,j  =  1»2).  Also,  by  invoking  the  invariance 

property  of  maximum  likelihood  (m.l.)  estimation  we  find  that  fhe  m.l.  esi  Lmator 

.2  2 

P  of  p  must  satisfy 

lo  *  o  *  ft  _i  *  _o  *  s  p 

(2.10)  0  =  <°12> !(”u«a)  =  <V  (eu<.22)r2  . 

The  above  implies  that  the  m.l.  estimator  of  I  is  a  positive  definite  matrix  with 

probability  one.  We  also  get 

2-2  s  ;  2 
P  912  -  0lie22  r  =  °* 

Substituting  the  values  of  022  and  0^o  from  (2. 7-2. 9),  we  obtain,  after  some 

simplification,  the  equation; 
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.11) 


10,  2  2,  2  2.  8r  2  2 


♦  P°[n1^n2^f1f2(-4f;Lf2+2f1i-2f2-r/)]  +  (6f  ^  f  ^ -6f  ^  f  7 

_6flf22  +  f/  +  f22  +  Zflf2)  ‘  2nin2n3nflf2  '  nin2n3r2(niflf2  +  n2flf2  '  Vl^lV1 

+  °4  (ni2n22(-4fl2f22  +  6fl2f2  +  6flf22  "  2fl2  -  2f22  '  4flf2)  +  2n1n2n3n(2flf2'ffV 

-t?n1n2[(-2n1n2  -  -n^  t  n^f^  t  n^  -  Zn^  t  n3(-n2  -  Sn^tn^} 

P2  (n12n22(f12f22  -  2fl2f2  -  2flf22  +  f/  +  f/  +  2fxf2)  -  Zn^nCf,^  -  f,  -  f?) 

+  n32"2  -  r?Cn1n2n3f1f2(-n1  -r>2  -2n3)  +  n^f.in/  t  -  n/ )  + 

2  2  2  2  2  2 
+  n^r>3  -  n2n3  -  n3  )  t  n3  (nx  +  )  +  n^ln^  +  n2>]}  -  r  (n.^  +  n  n3  +  n2n3  +  n3  ) 

(nlVlf2  +  nln3fl  +  n2n3f2  +  n3?)  1  °- 

2  o 

Let  f(p  )  denote  the  left  hand  side  of  (2.11).  Then  f(p")  =  0,  is  a  fifth 

2  ... 

degree  polynomial  equation  in  P  with  stochastic  coeff  jcten-’-s ,  and  every  root  it 
has  between  zero  and  one  can  be  ;ubstituted  in  (2. 7-2.0),  leading  to  a  set  of 
maximum  likelihood  estimators  a,,,  o  and  a  .  Finding  (closed)  exact  solutions 

for  the  above  polynomial  does  net  seem  to  be  an  easy  task.  However,  our  n- ^gram 


of  the  Monte  Carlo  study 


all  explain  later)  is  design''’  to  evaluate, 


by  a  simple  iterative  method,  the  roots  of  this  equation  which  lie  in  the  interval 

(0,1),  to  any  pre-assiened  degree  of  precision.  Nevertheless,  at  this  point  we 

2 

can  make  the  following  observations.  If  we  put  p  *  1 ,  we  find  after  some 

2  2  2 

simplification  that  f(l)  -  n3  n  [1  -  r  ]  >  0 ,  with  probability  one.  Similarly, 
we  find  that 

2  2  _ 

f(C)  -r  +  fy'j  +  n3  +  nintri  +  +  n3  1  < 


From  this  we  conclude  that  (2.11)  has  at  least  one  root  in  the  interval  (0,11 


with  probability  one. 
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3.  The  Simulation  Technique.  The  Monte  Carlo  study  conducted  here  involves, 

as  a  first  step,  the  generation  of  both  univariate  and  bivariate  normal  random 

samples.  For  this  purpose,  we  first  generate  independent  random  variables 

2 

distributed  uniformly  over  the  interval  (0,1).  Let  X,  =  (-2  lop,  U, )  cos(2mU„), 

=  (-2  log  e  U^)2  sin(2TrU2).  Then  it  is  well  known  that  X  and  X?  are 

independent  standard  normal  random  variables.  To  generate  a  sample  (Y  ,Y  ) ' 

from  a  normal  population  with  dispersion  matrix  £,  we  take  (Y^,Y  )'  =  T(X^,X^), 

where  T.  =  TT'.  To  examine  how  accurately  the  computer  is  approximating  sampling 

from  a  uniform  population,  87  samples  of  10,000  observations  each,  were  generated 

in  a  first  run,  and  44  samples  of  10,000  observations  each,  were  generated  in  a 

second  run.  A  chi  square  goodness  of  fit  test,  with  9  degrees  of  freedom,  was 

2 

carried  out  for  each  of  the  samples,  and  the  probabilities  of  the  X  were  about 
0.45  in  both  cases. 
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4.  Description  of  Computer  Input  and  Output.  The  set  of  input  parameters 
consists  of  n^,  n^,  n^,  ° ^  anc*  ^s  no'te^  earlier,  a  was  always  taken 

equal  to  1,  and  interchanging  the  values  of  n^  and  n^ ,  serves  to  avoid  any  loss 
of  generality  that  may  stem  from  this  choice  of  o  .  The  following  table  shows 
the  different  choices  of  the  set  of  values  of  the  sample  sizes  (n^,  n  ,  n3)  in 
our  Monte  Cano  study. 


Table  1.  Samnle  size  values. 


ni  n2  n3 


nl  n2  n3 


Group 


100 


0 


For  each  chosen  set  of  values  for  the  input  parameters,  500  samples  were  drawn, 


and  at  the  end  of  each  sample  the  estimators  c  (i,j  =  1,2)  are  calculated 

according  to  the  formulas  of  each  of  the  four  methods  under  consideration.  Then 

a  mean  cross-product  deviation  matrix  V  was  calculated  and  printed  out  for  each 

method  over  the  500  samples.  Thus  the  (3x3)  matrix  V  =((v 

r  r ,  v  i  ,i ) ,  (  i  ,3  )  ’ 

where  (i,j),  (i',j')  =  (1,1),  (2.2),  and  (1,2),  and  v  ,.  ..  ...  =  (500)"1 

500  -  .  r ,(i ,i ) ,( l ' ,3  ) 

t  f  (a_  ;  4  ~  P..)(5  .  ,  . ,  -  a  ,  , )  }  .  We  also  print  out  | V  | 1// 3  and  (1/3  trV, 

1  r9±9j  ij  1  ,j 


for  each  V.  In  the  beginning  of  the  study  and  for  values  of  (n  ,n2*n  )  of  the 

order  C  and  (1/2)C  we  chose  four  values  of  °22’  namelY  a 27  ~  ^>2,5  and  10» 

and  fine  values  of  P,  namely  p  =  -.  7 ,- . 3 ,+ .1,+ . 5  ,+ .9 ;  which  means  that  each  fixed 


choice  of  (n^  n2,  ng)  was  repeated  twenty  times.  However,  after  careful 

examination  of  the  output  matrix  V,  and  in  order  to  study  the  effect  of  the  values 

-1  -1/2 

of  °22’  eactl  v  was  transformed  to  DVD,  where  D  =  (l,*^  ’°22  Tho  matrix 

DVD  corresponding  to  the  estimator  E^,E  and  clearly  exhibited  its  stability 

or  invariance  with  respect  to  changes  in  the  (scaling)  parameter  022‘  However, 

the  determinant  and  the  trace  of  the  matrix  DVD  for  the  estimator  E  ,  tended  to 

increase  monotonically  with  0  .  It  was  decided  then,  that  only  the  case  0  =  1, 

Z  Z  Z  Z 

should  be  considered  in  the  input,  and  additional  values  of  P  were  introduced  as 


follows:  and  0  were  taken  to  equal  1  in  all  cases;  and  for  values  of 

(nl5n2,n3)  in  the  group  sizes  1/2C  and  <°,  p  took  the  values  -.7,-.3,.l,.5,.7  and 
.9;  and  for  values  of  (n^j^.n^)  in  the  group  sizes  2C,  3C,  4C  and  5C,  the  values 


-.7,-.3,-.l,.l,.3,.5  and  .9  were  assumed  by  P. 

It  may  be  useful  to  make  a  few  remarks  on  the  number  of  solutions  of  the 
2 

equation  f(p  )  =  0.  In  most  cases,  we  found  only  one  root  inside  the  interval 

2 

(0,1).  In  a  few  cases,  we  found  that  f(p  )  had  exactly  three  roots  in  the  unit 


interval.  This  indicated  that  the  solution  of  the  maximum  likelihood  equations 
was  not  unique  (this  situation  occured  only  for  very  small  values  of  fig)*  In 
this  case  we  computed  the  logarithm  of  the  likelihood  function  L,  and  took  the 
root  that  maximized  log  L. 

We  also  carried  out  an  (indirect)  overall  test  for  the  normality  assumption 

concerning  the  samples  obtained  in  this  study.  This  consisted  of  looking  at 

the  first  and  the  second  diagonal  elements  (say,  v^ ,  v  )  of  V  .  Using  the 

assumption  of  normality  of  the  various  samples,  one  can  easily  calculate  the 

mean  and  variance  of  both  v^  and  v^  was  computed.  Next,  we  note  that  v.  (i=l,2) 

ft 

is  the  average  of  500  independent  random  variables,  and  hence  using  v^  = 

[v.  -  E(v^)]/  /varvT,  may  be  considered  as  an  observation  from  a  standard 
normal  population.  The  values  of  v^  considered  in  this  test  arose  from  all  choices 
of  (n1,n2,n3>.  Using  the  cases  a  =  a  =  1  and  p  -  -.7,  -.3,  +.1,  +.5,  +.9, 

tfc  5V 

we  got  120  observations  on  each  of  v^  and  v^.  These  240  observations  were  then 

.  2  .  2 
tested  for  normality  using  a  X  ;  the  probability  of  the  X  exceeding  its 

observed  value  was  less  than  0.45. 

In  the  above  test,  we  could  have  similarly  used  various  other  elements  of 
the  matrix  V.( i=l,2 ,3 ,4) ,  but  we  felt  that  the  two  elements  actually  used  would 
be  sufficient.  Although  the  above  is  not  a  conclusive  proof  of  the  normality 
of  the  data,  it  seems  to  be  one  of  the  best  procedures  that  could  be  used  under 
the  present  circumstances.  A  direct  check  of  the  normality  of  all  the  samples 
would  clearly  have  been  too  expensive.  Furthermore,  even  if  each  sample  were 
tested  individually;  we  could  not  meaningfully  conclude  the  presence  of 


normality  in  general. 
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5.  The  Results  of  the  Study  and  the  Associated  Plots.  The  main  results  of 
this  study  are  depicted  in  a  series  of  plots.  Plots  1  through  4  are  constructed 
as  follows . 

The  different  values  of  the  correlation  coefficient  p  are  indicated  on  the 

horizontal  axis.  The  vertical  axis  exhibits  the  values  of  the  determinant  of 

the  matrix  V(  3  x3)  corresponding  to  the  four  different  estimators  and  the 

different  input  values  of  n..  ,n  ,n„  and  p,  with  o  =  a  =  1.  The  horizontal 

1  z  o  _L  J-  Z  Z 

axis  for  plot  5  corresponds  to  the  total  number  of  observations  (n^  +  n^  +  2n^) 

taken  for  size  orders  (1/2)C,  C,  2C,  3C,  4C  and  5C.  The  points  on  the  vertical 

axis  represent  the  value  of  j V (  (averaged  over  the  different  values  of  p)  for 

the  various  E . (i=l,2 ,3 ,4)  and  the  different  designs  D.,  where  (P  ,D  ,D  ,D  ) 

1  112  3  4 

correspond  to  the  values  of  n  =  (n^.n^n.^)  being  (7,3,5),  (10,0,5),  (R:a,2)  and 
(2,2,8),  respectively.  These  values  are  for  n_  of  the  ord°r  (  j./2)C.  Values  of 
n_  of  the  form,  say,  (7,3,5)  and  3,7,5),  belong  to  the  same  design  and  |v| 
was  averaged  over  them. 

By  analyzing  the  computer  output  directly  and  examining  the  preceding  plots, 
one  can  make  the  following  observations. 

(1)  The  comparative  efficiency  of  the  estimators  is  the  same  under  both  the 
determinant  and  the  trace  criterion  for  V.  Thus  we  restricted  attention  in 
drawing  the  plots  to  the  determinant  criterion.  However,  the  determinant 
criterion  is  also  more  meaningful  here,  since  the  parameters  being  estimated 
may  not  necessarily  be  in  the  same  scale. 

(2)  All  estimators  possessed  a  maximum  in  the  neighborhood  of  the  point  p  =  .1 
(one  expects  this  point  to  be  a  =  0),  except  for  the  designs  (8k, 8k, 2k); 

k=l ,2 ,4 ,6 ,8 ,10)  for  which  had  a  minimum  there.  However,  under  trV,  E^ 
always  possessed  a  minimum  at  that  point,  while  F,0,E3  and  E^  still  had  a  maxi¬ 
mum  there. 
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(3)  For  the  HM  design,  F  and  F  coincided,  as  expected  theoretically. 

O  *T 

(4)  For  almost  all  cases,  F  and  F,,  coincided  at  P  =  .1. 

(3)  F  behaved  extremely  badly  for  small  values  of  n^,  and  thus  should  not 

be  considered  in  the  designs  (8k, 8k, 2k).  This  estimator  was  never  the  best, 

and  it  was  always  the  worst,  except  for  very  large  values  of  n ^  and  P  '  .7, 

where  it,  sometime:'.,  became  the  second  best  behind  F  . 

4 

( b )  F0  is  the  best  lor  .  3  <  p  <  .5,  for  p  <  .8  in  the  cast'  of  samples  of 
order  1/?C ,  and  for  P  <  .85  in  (16,16,4). 

(7)  F  is  the  best  for  very  large  values  of  n  and  large  values  of  P. 

(?)  F  performs  its  best  for  small  values  of  p. 

(4)  Aside  from  F^,^  had  the  largest  range ,  being,  larger  than  F  and  F0 
for  small  p,  and  liaving  the  smallest  value  for  P  =  .9. 

(10)  is  the  most  stable  as  a  function  of  P,  followed  by  F^. 

(11)  Differences  among,  the  estimators  are  negligible  for  sample  sines  of  order 
50,  except  F^  in  (80,80,00),  where  it  joins  the  others  only  for  P  N  .6. 

Tn  the  final  analysis,  one  may  conclude  that  for  samples  of  sines  1/1Y 
and  C,  F0  is  highly  recommended  for  all  values  of  P  <  .8.  For  sample  sines 
of  order  kP(k=2,3,4,5),  Fn  is  to  lie  used  except  for  |p|  s  .5,  where  F 
becomes  the  most  efficient.  However,  the  simplicity  of  F0  should  count  heavily 
in  its  favor,  particularly  when  a  small  gain  in  efficiency  by  using,  F  is  not 
very  crucial. 


11 


REFERENCES 


V3 


1.  Hartley,  H.  0.  and  Hocking,  R.  R.  (1971).  Incomplete  data  analysis. 

Presidential  invited  lecture,  ENAR  and  IMS  Meeting,  College 
Park ,  Penn . 

2.  Hocking,  R,  R.  and  Smith,  W.  B.  (1968).  Estimation  of  parameters  in 

the  multivariate  normal  distribution  with  missing  observations. 

J.  Amer.  Stat.  Assoc.  63,  154-173. 

3.  Kleinbsum,  D.  G.  (1970).  Estimation  and  hypothesis  testing  for 

generalized  multivariate  linear  models.  Unpublished  thesis, 
University  of  North  Carolina,  Chapel  Hill,  N.C. 

4.  Roy,  S.  M. ,  Gnanadesikan,  R.  and  Srivastava,  J.  N.  (1970).  Analysis 

and  Design  of  Certain  Multiresponse  Experiments.  Pergamon 
Press,  London. 

5.  Wilks,  S.  S.  (1932).  Moments  and  distributions  of  estimates  of 

population  parameters  from  fragmentary  samples.  Ann .  Math . 

Stat.  3,  167-195. 


12 


Plot 


AM3Q  J°  Jooy  3Tqn3 


tion  Coefficient  p 


310 


170 


Unclassified _ 

DOCUMENT  CONTROL  DATA  ■  R  &  D 


^^^^^^^(SecurUt^t^^ihcmlior^o^ltle^bod)^i^mbatrm^^mn^nd^nn^mnnoimtittnn>u%^)^entere^Ahenth^ove7mU^repor^^^^mt^n^ed^ 
I  OW'OINA  TING  AC  TI VlTv  (Corporal*  muthorj  '  >«.  REPORT  leCyRi^  CL*SStf'CmON 


Department  of  Statistics 

Unclassified 

Colorado  State  University 

Zb  GROUP 

Fort  Collins,  Colorado  80521 

M-2 

S  REPORT  TITL  E 

A  Monte  Carlo  Comparison  of  Four  Estimators  of  the 

Dispersion 

Matrix  of  a  Bivariate  Normal  Population,  Using  Incomplete  Data 

4  OEICNIPTIVE  notes  (Type  ot  report  end  Incluaive  date  a) 

Scientific  Final 

3  au  t nORiSI  (piret  mm,  middle  ini 1 1ml,  Imet  name) 

J.  N.  Srlvastava  and  M.  K.  Zaatar 


•  REPORT  DATE 

February  1972 

7m.  Total  no  Of  PAGES 

26 

76.  NO  OP  REM 

5 

mm.  contract  or  grant  no 

F  3361 5-67-C-l 436 

6.  «mctno  7071-00-12 

c.  DoD  Element  61102F 

a.  DoD  Subelement  681304 

9b.  OTHER  REPORT  NOISt  (Any  other  number  a  thmt  mmy  be  meelgned 
thle  report) 

ARL  72-0032 

10  DISTRIBUTION  STATEMENT 

Approved  for  public  release;  dlstrlbu* 

tlon  unlimited 

ii  luPPLEMENtARy  notes 

TECH  OTHER 

12  SPONSORING  M*LIT  4R»  ACTIVITY 

Aerospace  Research  Laboratories  (LB) 
Wrlght-Patterson  AFB,  Ohio  45433 

Consider  a  random  vector  (X^.X^)  distributed  as  a  bivariate  normal  with  mean 
vector  zero,  and  dlsnerslon  matrix  T.  =  ( (o ^ j ) ) .  Suopose  we  are  given  sanples  of 
sizes  n1  and  n.,,  respectively,  from  the  maroinals  of  X.j,X2,  and  a  sample  of  size  n3 
from  the  bivariate  population  of  (Xj,X2).  Suppose  the  problem  is  to  obtain  a  oood 
estimator  of  z  based  on  the  above  (Incomplete)  sample.  In  this  ,iaoer,  four  esti¬ 
mators  of  £  are  compared  using  Monte  Carlo  methods,  and  it  is  found  that  a  certain 
relatively  simple  estimator  of  l  Is  the  "best"  or  close  to  the  best  In  almost  all 
situations. 


DD 


rot** 

l  NOv  «8 


147 


Unclassified 


3 


Unclassified 


U  S  Government  Printing  Office:  1972  —  759  085/532 


Security  Clt»ufic»uon 


